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(54) Automatic image data quality adjustment to reduce response time of a Web server 



(57) The present invention provides methods, de- 
vices and systems for dynamically adjusting transcod- 
ing parameters so as to increase the benefits of trans- 
coding. Methods of adaptation are designed to cope 
with the variability of network characteristics and of the 
size of transcoded images. The invention also provides 
a method and apparatus to enable the transcoding 
proxy to adjust a quality-size tradeoff on a per-image 



and/or a per-client basis. The adaptive transcoder 
chooses different parameters for each object, and pro- 
vides performance improvements. The invention further 
provides a general framework for making policy deci- 
sions taking into account available bandwidth, content 
and type of image, and user preferences. The invention 
also includes methods for generating feedback about 
the choice of optimal transcoding parameters to the us- 
er. 
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Description 
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when high compress™ ratio"" be ac ^^^als^^rjir^' " 0nly 
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it makes things worse due to the store and forwa? 7nl I S ' transcodln9 does " ot he 'P at aN. On the contrary, 
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data over low-speed wide-area wireless links, like Cellular Digital Packet Data (CDPD) 163 that provides 10 kb/s or 
less of throughput, depending upon the number of clients sharing the link. The end client could be a full-function PC 
134 with a good color display so that the primary problem is bandwidth reduction. Alternatively the client might be a 
small, web-enabled, mobile devices 130-132 in which case it is advantageous to tailor the web data for the specific 
s client device, particularly the client's display characteristics. Different aspects of transcoding proxy design benefit dif- 
ferent scenarios (i.e., tariffed proxy-client link, strong/weak client display, etc.), but all are handled well within the same 
proxy architecture 100. 

[0012] Figure 2 shows a block diagram of an embodiment of transcoding proxy 1 90 used to transform objects based 
on user specified preferences and static policies. A transcoding proxy 190 is built by combining a transcoding module 

io 240 with an HTTP proxy engine 220. An HTTP request 222 originates from a client 230 and is forwarded 224 by the 
proxy 190 to a web server 210. The response data 226 (i.e., HTML pages and GIF and JPEG images) are transformed 
by the transcoder 240 and then forwarded 228 to the client 230. Typically, a number of transcoding parameters are 
specified to the transcoder 240 in order to achieve the desired quality/size reduction of the object contained in the 
response data 226. Transcoding proxies in use today either use a static set of policies 250 or use some form of user 

15 specified preferences 260 via path 265 to determine the transcoding parameters. When a fixed set of transcoding 
parameters are applied to all objects, results are not always beneficial. In fact, in many cases, transcoding leads to 
poorer performance. 

[0013] Accordingly, in a first aspect, the present invention provides a method for a transcoding proxy to facilitate 
browsing between a plurality of client devices and a plurality of servers connected via a communication network, the 

20 method comprising: receiving a request from one of the client devices for an object stored at one of the servers, for- 
warding the request for the object to said one of the servers, receiving the object from said one of the servers, examining 
preferences specified by a user of said one of the client devices, examining contents of the object, examining commu- 
nication network characteristics, choosing a set of transcoding parameters, forming a transcoded form of the object, 
and sending the transcoded form to said one of the clients. 

2B [0014] Preferably, the network characteristics include bandwidth, and examining network characteristics includes 
estimating network bandwidth between said one of the servers and the proxy as well as between the proxy and said 
one of the clients. 

[0015] Preferably, the network characteristics include delay and the step of examining network characteristics in- 
cludes estimating delay between said one of the servers and the proxy as well as delay between the proxy and said 
30 one of the clients. 

[0016] The method preferably further comprises providing feedback to the user about a level of transcoding per- 
formed on the object to form the transcoded form. 

[0017] Preferably, the step of examining includes determining the size of the object. 

[0018] In the method as described the object may be of type image forming an image object, and the method further 
35 comprises: determining dimensions of the image object, and calculating the compression ratio of the image object. 
Further to be preferred is that the dimensions of the image object are determined by area of the image in square pixels, 
and the compression ratio is determined by the bpp ratio of the image object. 
[0019] The step of forming a transcoded form preferably employs dynamic adaptation. 

[0020] The step of forming a transcoded form is preferably started before the step of receiving the object from said 
40 one of the servers is complete. The received object type is preferably of type JPEG forming a JPEG object. Further to 
be preferred is that the step of forming a transcoded form includes performing JPEG-to-JPEG image transcoding and 
the step of sending the transcoded form starts writing out at least one MCU of JPEG-encoded output image data before 
the step of receiving the object is complete. 

[0021] In the method as described, the step of sending out the transcoded form is preferably started after processing 
45 an initial fraction of the received object, and before the step of receiving the image object from said one of the servers 
is complete. 

[0022] The step of sending out the transcoded form is preferably started before the step of forming a transcoded 
form of the object is complete. 

[0023] In a second aspect, the present invention provides a method for a proxy to form a transcoded form of an 
so object received from a server in satisfaction of a request from a client for an object available from the server, the method 
comprising dynamically adapting parameters for transcoding the object for the client, forming a transcoded form of the 
object, and sending the transcoded form to the client. The step of adapting parameters preferably includes determining 
at least one characteristic of the object. One characteristic is preferably an object-header, the object-header providing 
information about the size and the type of the object. The method preferably further comprises comparing the size of 
55 the object to a threshold parameter called "sizejhreshold". The step of adapting parameters preferably includes gath- 
ering present network characteristics between the server and the proxy and between the proxy and the client. One of 
the characteristics is preferably network bandwidth and the step of adapting includes estimating network bandwidth 
between the server and the proxy as well as between the proxy and the client. 
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[0024] In the method as described, the transcoded form is preferably dependent upon the estimated bandwidth The 
step of adapting also preferably includes retrieving preferences of the user, and wherein the transcoded form is de- 
pendent upon the preferences. The step of adapting preferably includes examining the contents of the object The 
object is preferably of type image forming an image object, and examining the content of the image object includes 
determining dimensions of the image. 

[0025] In the method as described, the step of adapting is preferably dependent upon determining the compression 
ratio of the image object. The type of image object is further preferably GIF and wherein the step of adapting is de- 
pendent upon comparing the compression ratio against a predetermined policy threshold called ■gifjhreshold" 
[0026] The method as described preferably further comprises predicting at least one parameter of the transcoded 
orm of the object. The transcoded form is further preferably the same as an original form of the object Preferably at 

.... r _..^_ I1IWiUwa a ol/Lt; Ul ltlo ud.iauuueu luirn. rreieraoiy also, at least one of the parameters 

includes the time spent in transcoding the object. 

[0027] In a third aspect, the present invention provides a method for predicting parameters of a transcoded form of 
an object, the object having an initial size and dimension, and the object being received from a server in satisfaction 
o a request from a client for the object, the method comprising: computing the bpp ratio of the object, gathering a set 
of statistics of a plurality of previously transcoded objects, and employtng the set of statistics and the bpp ratio for 
predicting the parameters. HH 
[0028] In a method as described, at least one of the parameters is preferably size and the set of statistics includes 
sizes of a plurality of previously transcoded objects statistics. 

rnnooi tu ~ ~u : « : * i_. 

t — „j , „o w, C w. » M , s.e.au.y o. lype image ana me set ol statistics includes image quality. The plurality of previously 
transcoded objects are preferably chosen from a predetermined benchmark suite of images. 

[0030] In a method as described, the step of employing preferably uses dynamically updating the set of statistics 
using the statistics of the currently transcoded object. At least one of the parameters is preferably a duration for forming 
a transcoded form of an object and the set of statistics includes the duration for forming a plurality of previously trans- 
coded objects. 

[0031] In a fourth aspect, the present invention provides a transcoding proxy system for facilitating browsing between 
a plurality of clients and a plurality of servers connected via a communication network, the proxy comprising an HTTP 
proxy engine to receive a request from one of the clients for an object stored at one of the servers and to fetch the 
object from said one of the servers, an object transcoder to form a transcoded form of the object using a set of param- 
eters for transcoding, a dynamic policy module to determine the set of parameters of transcoding, an image size and 
delay predictor module to gather characteristics of the object, a user preference module to gather quality preferences 
specified by a user of said one of the clients, and a bandwidth estimation module to estimate available network band- 
width, wherein dynamic policy module dynamically adjusting the parameters of transcoding using the input received 
from the image size and delay predictor module, user preferences module, and bandwidth estimation module for the 
purpose of improving satisfaction for the user, and the transcoding system providing feedback to the user about the 
level of transcoding performed. - - 

[0032] The user preferences module preferably further collects the characteristics such as display size resolution 
& CPU speed of said one of the devices, and provides those characteristics to the dynamic policy module 
[0033] In the system as described, the bandwidth estimation module preferably collects traces of previously estab- 
lished network connections between said one of the servers & the proxy, collects traces of previously established 
network connections between the proxy and said one of the clients, and estimates the object download time by per- 
forming statistical analysis on the collected traces. 

[0034] The statistical analysis used for estimating bandwidth between said one of the servers and the proxy is pref- 
erably based on computing a statistical measure such as median, mean, or mode of download times of previously 
fetched objects as determined from the collected traces. 

[0035] The statistical analysis used for estimating bandwidth between the proxy and said one of the clients is pref- 
erably based on computing aggregate bandwidth of all active connections between the proxy and said one of the 
clients. The system preferably further comprises displaying a slide bar on said one of the client's display for collecting 
the user specified preferences. In a system as described, the user of said one of the clients can preferably specify the 
tradeoff between download time and data quality through the use of a graphical user interface with a slide bar The 
user of said one of the clients can preferably specify through the use of a graphical user interface with a slide bar the 
tradeoff between download time and image quality including a specific switch to select color or gray scale as the desired 
output form. The user of said one of the clients can further preferably specify through the use of a graphical user 
interface with a slide bar, the desire to maintain a target response time such that the system automatically reduces 
data quality (and hence data download size) to compensate for dynamic variations in bandwidth to said one of the 
chents. The graphical user interface slider bar is also preferably used as an output interface for showing the optimal 
choice of transcoding parameters to the user. 

[0036] In a fifth aspect, the present invention provides an article of manufacture comprising a computer usable me- 
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dium having computer readable program code means embodied therein for causing dynamic adaptation of transcoded 
form of an object in a transcoding proxy, the computer readable program code means in said article of manufacture 
comprising computer readable program code means for causing a computer to effect: a proxy receiving an object 
associated with a user from a server, determining parameters of the object, retrieving preferences of the user, gathering 
5 present network characteristics, obtaining transcoding policy thresholds, performing a policy decision based upon ob- 
ject parameters, user preferences, network characteristics, and policy thresholds, forming a transcoded object, pro- 
viding feedback of a level of transcoding performed on the object to the user, and sending the transcoded object to the 
user. 

[0037] In a sixth aspect the present invention provides an article of manufacture comprising a computer usable 
10 medium having computer readable program code means embodied therein for causing a transcoding proxy to facilitate 
browsing between a plurality of client devices and a plurality of servers connected via a communication network, the 
computer readable program code means in said article of manufacture comprising computer readable program code 
means for causing a computer to effect: receiving a request from one of the client devices for an object stored at one 
of the servers : forwarding the request for the object to said one of the servers, receiving the object from said one of 
is the servers, examining preferences specified by a user of said one of the client devices, examining contents of the 
object, examining communication network characteristics, choosing a set of transcoding parameters, forming a trans- 
coded form of the object, and sending the transcoded form to said one of the clients. 

[0038] The computer readable program code means in said article of manufacture preferably further comprises com- 
puter readable program code means for causing a computer to effect providing feedback to the user about a level of 

20 transcoding performed on the object to form the transcoded form. The computer readable program code means in said 
article of manufacture preferably further comprises computer readable program code means for causing a computer 
to effect determining dimensions of the object, and calculating the compression ratio of the object. The computer 
readable program code means in said article of manufacture preferably further comprises computer readable program 
code means for causing a computer to effect starting the step of forming a transcoded form before the step of receiving 

25 the object from said one of the servers is complete. 

[0039] * In a seventh aspect, the present invention provides a computer program product comprising a computer 
usable medium having computer readable program code means embodied therein for causing a proxy to form a trans- 
coded form of an object received from a server in satisfaction of a request from a client for an object available from 
the server, the computer readable program code means in said computer program product comprising computer read- 

30 able program code means for causing a computer to effect: dynamically adapting parameters for transcoding the object 
for the client, forming a transcoded form of the object, and sending the transcoded form to the client. 
[0040] The computer readable program code means in said computer program product preferably further comprises 
computer readable program code means for causing a computer to effect gathering present network characteristics 
between the server and the proxy and between the proxy and the client. The computer readable program code means 

35 in said computer program product preferably further comprises computer readable program code means for causing 
a computer to effect adapting parameters for transcoding based upon the estimated bandwidth and preferences of the 
user. 

[0041] In an eighth aspect, the present invention provides a program storage device readable by machine, tangibly 
embodying a program of instructions executable by the machine to perform method steps for predicting parameters 
40 of a transcoded form of an object, the object having an initial size and dimension, and the object being received from 
a server in satisfaction of a request from a client for the object, said method steps comprising: computing the bpp ratio 
of the object, gathering a set of statistics of a plurality of previously transcoded objects, employing the set of statistics 
and the bpp ratio for predicting the parameters. 

[0042] Said method step of employing the set of statistics preferably further comprises updating the set of statistics 
4S using the statistics of the currently transcoded objects. 

[0043] An aspect of the present invention is to provide methods, devices and systems for dynamically adjusting 
transcoding parameters so as to increase the benefits of transcoding. Methods of adaptation are designed to cope 
with the variability of network characteristics and of the size of transcoded images. 

[0044] In an embodiment, the invention includes three new components: an image size predictor, a network band- 
50 width (b/w) analyzer, and a policy module. Before initiating any transcoding action, the policy module queries the image 
size predictor to estimate the size of the output image. The b/w analyzer is queried to collect an estimate of the image 
transmission time from the server to the proxy, and from the proxy to the client. Based on the collected estimates, the 
proxy decides whether or not to transcode the image. In addition, the policy module can also compute the optimal point 
in the quality vs. size tradeoff that would provide the user specified performance criteria (e.g., reduced response time, 
55 increased quality). 

[0045] Another aspect of the present invention is to provide a method and/or apparatus to enable the transcoding 
proxy to adjust a quality-size tradeoff on a per-image and/or a per-client basis. The adaptive transcoder chooses dif- 
ferent parameters for each object, and provides performance improvements. 
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[0046] Still another aspect of this invention provides a general framework for making policy decisions taking into 
account available bandwidth, content and type of image, and user preferences. The administrator of the proxy chooses 
from a var.ety of optimization objectives so as to obtain improved performance from the system. In one embodiment 
when the proxy transcoder throughput is the bottleneck, the policy module is instructed to judiciously use CPU resources 
so as to reduce the response time for all users. An advantageous element of the invention is the automated nature of 
decision making, to free up users from actively controlling the policy engine of the proxy. 

[0047] In still another aspect of the present invention a method is provided for generating feedback about the choice 
of optimal transcoding parameters to the user. In an example of embodiment, the transcoding system provides feedback 
to the user by dynamically adjusting the position of the user preference slider bar. The user preference slider bar serves 
both as an input as well as an output device. 

f 00481 A nrpforrorl omhnHimont ^ rtt „. »:__...:»».. i ..... 

. ., -,...„,„ ...^ K , 0< ,ci.i ikvciiiiuii win nuw ueuescriDea Dy way ot example, with reference to 

the drawings in which: 

FIG. 1 shows an example transcoding proxy scenario with varied links and varied client devices; 

FIG. 2 shows a block diagram of a transcoding proxy used to transform objects based on user specified preferences 
and static policies; 

FIG. 3 shows a block diagram of an example of a transcoding proxy modified in accordance with the present 
invention to include a,, i„ la ge size piedicior, bandwidth estimator, dynamic policy module, and a user feedback 
generator; 

FIG. 4 shows an example block diagram of an example HTTP proxy with caching and transcoding modules in 
accordance with the present invention; 

FIG. 5 shows an example flow diagram of a transcoding dynamic policy module in accordance with the present 
invention; r 

FIG. 6 shows an example web request-response cycle using a transcoding proxy in accordance with the present 
invention; r 

FIG. 7 shows an example regime where transcoding is useful in accordance with the present invention; 

FIG. 8 shows an example flow diagram of a example policy function in accordance with the present invention; 

FIG. 9 shows an example block diagram of an example image size prediction module in accordance with the 
present invention; 

FIG. 10 shows an example block diagram of an example bandwidth prediction module in accordance with the 
present invention; 

FIG. 11 shows input/feedback user interface in accordance with the present invention; and 

FIG. 1 2 illustrates an example timing diagram of streaming transcoding in accordance with the present invention. 

[0049] Figure 3 shows a block diagram 300 of an example embodiment of changes to transcoding proxy 190 in 
accordance with the present invention. A comparison of Figure 3 to Figure 2 shows a replacement of the static policy 
module, 250 in Figure 2, with a dynamic policy module, 370 in Figure 3. A purpose of the dynamic policy module 370 
is to make decisions concerning when to turn transcoding on and off and what transcoding policy (i e the transcoding 
algorithm along with its parameters) to use. The dynamic policy module 370 also interfaces with an image size and 
delay predictor 375, a bandwidth estimator 380, and a user feedback provider 390. In the embodiment shown the 
policy module 370 employs a number of criteria, including: the characteristics of the data (e.g , size of the images 
current encoding effic.ency, structural role in the HTML page) as determined by the content analysis flow diagram 
(shown in Figure 5), the current estimate of the bandwidth on the proxy-to-client and server-to-proxy links (shown in 
Figure 10), the characteristics of the client, particularly the client display capabilities, and the user preferences con- 
cerning the preferred rendering of the data (shown as the user slide bar preferences in Figure 11). 
[0050] The items shown in Figures 5, 10 and 11 are described below. In particular, the user slide bar preferences of 
Figure 1 1 , provide a method of interacting with the transcoding proxy so as to dynamically change the tradeoff between 
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image quality and download time. In addition to serving as an input interface, the slider bar (1140 1150 1160 of Figure 
11) also acts as an output interface, displaying the feedback 390 that is received from the dynamic policy module 370. 
[0051] Figure 4 shows a block diagram 400 illustrating an example function of a multi-resolution cache 410 in a 
transcoding HTTP proxy system 400 in accordance with the present invention. Caches are useful in HTTP proxies to 

5 provide reduced response time for repeated data requests (by the same or different clients) for the same data object. 
A variety of methods may be used to assure that the cached data is up-to-date. In the example caching transcoding 
system 400, the multi-resolution cache 410 stores an original version of the data, as well as other possible forms of 
the data, including reduced resolution versions that have been transcoded for specific device types. 
[0052] As an example, we again consider the case of an image data object, but it is understood by those skilled in 

10 the art that these methods can be applied to other data types. We describe a method for storing, tagging and retrieving 
various forms of the data in the context of a caching transcoding proxy system. 

[0053] Referring to Figure 4, it is assumed that a JPEG image is received from the server 210 in response to a 
request by a client 230. This image is decoded from the lossy JPEG encoding format to a bit-map representation of 
the image. Those skilled in the art of image processing recognize that the JPEG encoding standard describes images 

is using coefficients that weight Discrete Cosine Transforms (DCTs) so a compute-intensive decoding step is generally 
required to obtain the actual colors of each pixel. This bit-map requires a larger storage size than the original JPEG 
image. However, given an adequate size cache it may be worthwhile to store (via data path 420) this expanded size 
image so as to avoid the compute-intensive step of JPEG decoding when a transcoded version of the same image is 
later required but with different transcoding parameters than the first request. In a second step, based on the transcoding 

20 parameters (i.e., color depth, scaling factor, and the JPEG quality factor) the JPEG image is re-encoded. The image 
may be re-encoded as a JPEG image or in an alternative encoding format. The final transcoded version of the image 
is also stored in the multi-resolution cache 410 via data path 420. 

[0054] In the example embodiment, when additional data requests occur, the HTTP proxy 220 first checks its cache 
410 to see if an 'up-to-date" version of the data object is available at the requested resolution or transcoding level. 
25 Each object in the multi-resolution cache 41 0 is stored with a version specifier that includes: a URL description, a time 
stamp for the data object, and the object characteristics. For a JPEG image the object characteristics include color- 
depth, scaling factor, and JPEG quality factor. An alternative embodiment has an indication that the JPEG image has 
been decoded and stored in its bit -map form, or that it has been converted and stored as a GIF with various charac- 
teristics. 

30 [0055] If an up-to-date version of the object is available with the requested type and resolution, then that version is 
returned to the client. If this is not available, but an up-to-date version of the object exists in either its original JPEG 
form or the decoded bit-map form, then this version is returned to the transcoder 240 with an indication of its charac- 
teristics. This enables the transcoder 240 to produce the desired version of the object, which is returned to the client 
230 and stored in the multi-resolution cache 410 via data path 420. 

35 [0056] It is noted that there are several extensions to this scheme that are obvious to those skilled in the art. One 
extensions uses an already transcoded version of the object, rather than to the original data object, to generate a 
further resolution-reduced version of the object. Methods for managing the different resolution versions of data objects 
in a cache are further described in R. O. LaMaire and J. T. Robinson, "Conserving Storage Space by Means of Low 
Resolution Objects", docket Y0997308, US patent application filed February 13, 1998. 

40 [0057] Figure 5 shows an example flow diagram for the dynamic policy module 370 of Figures 3 and 4. Figure 5 also 
shows how the policy module 370 interfaces with the HTTP proxy engine 220 and the object transcoder 240 of Figure 
3. The methods described below apply to many content types including text, images, audio, and video. However, the 
following discussion focuses on image data types only. It is evident to those skilled in the art that concepts and dynamic 
policies are applicable to other media types. 

45 [0058] Figure 5 shows that based on the response header received from the server 21 0, the HTTP proxy engine 220 
first determines the size of the object 510. If the content-type of the response is "image/*" 520, the proxy engine 220 
passes the handle for the object to the dynamic policy module 370 for further analysis. Inside the policy module 370, 
the size of the input object is compared against a pre configured threshold called "size_threshold" 530. If the objects 
is smaller than "sizejhreshold", or the content type is not "image/*" 520, then the object is not transcoded, but instead 

50 is forwarded 51 5 to the client without any content modification. Small objects (such as bullets, thumbnails, logos, etc.) 
found on the web are typically GIF objects which are already well compressed due to GIF encoding. Transcoding such 
objects does not generally yield further compression. 

[0059] We convert GIF images to GIF or JPEG images that are reduced in size and/or color-depth (the choice of 
GIF or JPEG as the end format depends on the image characteristics). In addition, we convert JPEG images to JPEG 
55 images that are reduced in JPEG quality, size, and/or color-depth. JPEG quality refers to a transcoding parameter that 
is used to determine the degree to which the coefficients of the Discrete Cosine Transformations used in the JPEG 
encoding standard are quantized. It has been found that the JPEG quality parameter is also a good predictor of per- 
ceived image quality. This parameter varies in the range 1 to 100, where 1 00 represents very high quality. JPEG images 



EP 0 992 922 A2 



found on the web typically have a JPEG quality parameter of 75. 

[0060] For large images, the type of the image coding (JPEG or GIF) and the efficiency of coding are important 
factors in transcoding decision making. Since JPEG is a lossy compression method, size reduction is always possible 
by reducing quality factor. Similar quality reduction, however, cannot be applied to GIF files since GIF is a lossless 
compression method. To achieve quality reduction, a GIF file must first be decoded and then re-encoded as a quality- 
reduced JPEG image. This method, however, may not always provide size reduction. GIF format is usually more efficient 
for coding maps, logos, and drawings while JPEG is more efficient for coding natural images. Converting GIF to JPEG 
is useful only when the original GIF image is not efficiently coded. 

[0061] we define bits per pixel (bpp) as a measure of the compression efficiency. Bpp is computed as the ratio of the 

image file size to the image area in pixels. In processing step 540, X and Y dimensions and the bpp value of the incut 
, K ^„, a „,„ „,,c.yc Moauoi. ii uit mpuioujeci isoi type "image/jpg" 550, transcoding is always 

performed. -However, if the content type is "image/giP 525, only those objects which yield a bpp ratio larger than 
9 -threshold 1 535 are transcoded. GIF files that are not very efficiently encoded yield a bpp value that is larger than 
gif JhresholdV Thus, the decision step 535 is very effective in identifying compressible GIF files with high accuracy. 

Though not show in Figure 5, it is evident to those skilled in the art that other transcoding policies, such as scaling and 

file truncation (for progressively encoded data) can be used for well-compressed GIFs. 

[0062] An important aspect of the proposed invention is that decision steps 51 0-565 are carried out as soon as the 
image header is received. If the decision is to not transcode, image segments can be forwarded as soon as they are 
received from the server without incurring store and forward delay. Similarly, when transcoding is to be performed 
images can either be buffered and ii len iranscoded (store and forward transcoding), or each segment can be transcoded 
on-the-fly (streaming transcoding method). 

[0063] After identifying an image that is compressible, the next step involves determining the extent to which the 
selected image should be transcoded. The policy function 565 is responsible for collecting input from three different 
sources (image size predictor 375, bandwidth estimator 380, user preference selector 260) and subsequently selecting 
transcoding parameters in accordance with the steps shown in Figure 8. The chosen parameters determine the extent 
and types of compression performed by the object transcoder 240. For example, the scaling parameter determines 
how much an image is downsampled. Quantization parameters control how an image is quantized in the pixel domain 
and/or the frequency domain. The number of colors in a color mapped image can be reduced, or a 24-bit color image 
may be converted to 8-bit grayscale, or even a monochrome representation. The process of transcoding is performed 
in step 570 and the output of the transcoder is forwarded to the client 230 

[0064] An important aspect of the policy function 565 is the analytical Iramework for making transcoding decisions 
The analytical framework takes into consideration factors such as available bandwidth, type and size of the image 
user preferences and provides an objective criteria for making transcoding decisions. As an example, we consider the 
objective of minimizing response time for the user, but it is understood by those skilled in the art that using the same 
framework other optimization criteria can also be applied. We describe a method for determining when it is beneficial 
to transcode, and to what extent transcoding should be applied. The embodiment described herein is referred to as 
dynamic adaptation of transcoding parameters. 

[0065] Figure 6 shows an example web request-response cycle and the response time of fetching an object of size 
S through a store-and-forward transcoding proxy 190. We define a store-and-forward image transcoder as an image 
transcoder which must wait to accumulate an entire input image before transcoding can begin on this image and then 
must wait to generate a transcoded image in its entirety before it is made available to be output As shown in Fiqure 
6, the original image 620 of size S (bytes) 670 is downloaded into the store-and-forward proxy over the seiver-proxy 
connection with effective bandwidth (bits/sec). The transcoder introduces a delay DJS) 650 and generates an 
output image 630 of size SJS) 680. Both the transcoding delay 650 and output image's byte size 680 are denoted to 
be dependent upon the input image's byte size S 670. The Iranscoded image is then transmitted over a proxy^lient 
connection having effective bandwidth B^ 

[0066] The policy function needs to weigh the cost (delay) of transcoding against any size reduction achieved by 
transcoding. For transcoding to provide benefits, delay introduced due to transcoding must be offset by the reduction 
in transmission time due to compression. For very low bandwidth proxy^lient access links, the reduction in response 
time due to aggressive image compression typically far outweighs the addition to response time caused by compute- 
•ntensive transcoding. However, Figure 7 shows that as the bandwidth of the proxy-client link increases, there comes 
a point (transcoding threshold 710) at which it is no longer beneficial to transcode since the reduction in response time 
due to aggressive compression decreases as a function of the bottleneck link's bandwidth, while the transcodinq time 
remains constant. a 

[0067] Suppose R a is the response time of fetching a web object of size S from the web server with transcodinq 
turned off. Similarly, let R p denote the response time of fetching the transcoded version of the same web object through 
the transcoding proxy. For the purpose of the following discussion we assume that caching is not supported at the proxy. 
[0068] The client perceived response time with transcoding turned off is the sum of the following three terms- 
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*o = 2 * ^pc + 2 * fl7T sp + S/ ™ n ( B pc' B sp) 

[0069] flTT pc is the network roundtrip time latency between the client and the proxy and, similarly, RTT cp is the 
5 latency between the proxy and the server. Fetching the web object requires a TCP SYN/ACK exchange as well as an 
HTTP request/response, thereby contributing 2* RTT pc -h2* RTT sp \o the delay term. In addition, a web image incurs 
a transmission delay equal to the spread in time between arrival of its first and the last bits. Let min (B pc , B sp ) denote 
the bottleneck bandwidth between the client and the server. In the absence of a proxy, the first and the last bits of an 
image will be spread in time by S/min (B^, B sp ). This spread corresponds to the effective transmission time of the 
io image over the concatenated server-to- proxy-to-client connection. 

[0070] when transcoding is turned on, the proxy operates in a store and forward mode. 2 * RTT pc +2* RTT sp is 
again the fixed component of the response time. D p (S) is the additional term that represents the transcoding delay. 
The resulting response time for the transcoded object can be expressed as: 

R p = 2 * RTT pc + 2 * RTT^ + D p (S) + S/B^ + SpfSyB^ 

Transcoding will reduce response time if R p < R Q . That is, 

O p (S) + S/B sp + SJSyB^ < S/minfB^ B^) 

Clearly, when B pc > B sp , R p is always greater than R Q . On the other hand, when B^ < B^, transcoding is useful if and 
only if: 

25 

D p (S) + Sm $p <{S-S p (S)}/B pc 

[0071] The above equation precisely characterizes the regime in which transcoding reduces response time. Figure 
30 8 shows a flow diagram of an example policy function constructed using the analytical framework described above. 
810 marks the entry point of the policy function. The policy function 800 is called from 565 with the original image object 
size as one of the inputs to the policy function. Variable S is set equal to the input object size and the quality factor q 
is initialized to the best possible initial image quality (for example, the quality of the input image). In step 830, the policy 
function 800 issues a query to the bandwidth estimator 380 asking for the estimated download time of the object size 
35 S from the specified server to the proxy, referred to as TrfS). It also asks for the estimate of the download time for the 
same object from the proxy to the client, referred to as T 2 (S). Based on the logs of previous connections to the chosen 
destination, the bandwidth estimator returns an estimate of TrfS) and T 2 (S). In the next step 840, the policy function 
queries the image size & delay predictor 375 to find an estimate of the transcoded image size S p (S). It then computes 
the estimated download time savings by subtracting the download time estimate of the transcoded image size T 2 (S p 
40 (S))Uom T 2 (S) 850. Finally, in step 870 the two quantities (transcoding delay + T^S) - the download savings) and the 
target response time reduction are compared. If the first term is less than the second term, computation is stopped 
and the chosen quality factor q is returned as an output 880. Otherwise, the loop 840-870 is reentered with a reduced 
q value. 

[0072] It is noted that more efficient search techniques or variants of objective functions can be designed by those 
45 skilled in the art without departing from the spirit of the policy function framework presented in this invention. One 
extension would be to rearrange the terms in the policy equation as follows: 



50 



55 



Response time reduction(q) = {S - S p (S)}/B pc - D p (S) - S/B^ 

[0073] In the above equation, S p (S) is also a function of the quality factor (the smaller the output size, the poorer the 
quality). Note that the transcoding delay has been found to be effectively independent of the quality factor. There are 
two independent variables in the above equation: q, the quality factor, and the target response time reduction. Several 
different policies can be developed within the framework of the above equation. For example: 

Minimize response time for ail users; 

Maximize quality for a user specified response time constraint; 
Optimize overall system performance, not just one user. 



EP 0 992 922 A2 



EST- IT! ,ranSCOdln9 ,s a compute-intensive task and the proxy only has limited CPU power the proxy's CPU 

ZTtuZ 2 IS? " ,f * t9SkS ,ha ' maXimi2e OVSra " USSr SatisfaC,ion - Du ™9 ,he P 6ak per odjf every 
user turns transcoding on. there may not be sufficient CPU cycles available to satisfy all transcoding requests Bv 

S Ih ^P™»i.i,y of parameter se.ec.ion to the policy function 800. the system can au the 
M ' 9 ParameterS and PfOVide C ° nsis,ent and P redictab,e response to the end user If use^rs were to 

Le us nTw?" ,n H de r f^* tranSCOdin9 SyS,6m C0Uld not ^ a < tbe °P«-al performance" e, 
[0075] Let us now cons,der the transcoding delay and size estimation. The decision step 870 requires two addrtional 
nputs: an estimate of the size o. the transcoded image, S p (S), and the tianaco^ delay oSZTJ^IS 
the imr SS , an f C ° din f 9 An aCCUr3te PrediC ' i0n ° f ,hese two " uanti,ies is diffTcult t obtain since ZTowZl 

rnnvci" c" " a '"'°> " IB I-" 0 *** 1 invention uses statistical methods to estimate these two quantities 

n " 3 b 0 toCK dia9ram °' an eXamp,S ° f ,he ima 9 e size P redictlon accordanc^th he p e en. 

bench™ tl T r ' n 91 °;, We ini,ia,i2e ° Ur SyStSm With S,atiS,ical Characterizations of S x(q) that are bated on aTame 

as it letches and transcodes new image objects. The function of 920 enables the image size predictor 375 to adaot its 
behavior to dillerent data sets that a client may be accessing preoicior j/5 to adapt its 

[0077] Thus, in the method described in 930, we use statistical methods to estimate the delay and cutout size and 
l^l bP . P :!' i0 „ aS .! h „ e Cri,eria * P ert0 -^ a ™e -urate classification of images and JcTZZT^ 
™™ "' C '" u, "" y "*" sc o°ea «» size. I bis method works particularly well for GIF files 

Sol thTs Ia T, i S, Stat H , f l T^ 3 ' Dp(S> be 6S,imated baS6d ° n benchmark initiali ^«ion and on-line up- 
, ,he H S,a, ' S,,Cal model we have als ° determined from our statistical work, that the transcoding delay is well 

™™ ,! 35 be ' ng 3 " near ,UnC,i ° n ° f ,he number of P ixels & opposed to file size) * 
St ? W ^ C ,° nSlder bandWidth 3nd d0Wn '° ad ,ime es «™tion. The accuracy of decision step 870 also de- 
pends to a largo ox en, on the accuracy of image download time estimates. Bandwidth estimator 380 shL in Figut 
3-5 is responsible (or providing these time estimates. The bandwidth estimator 380 records every packet send and 
receive even, at the proxy and performs statistical analysis on the collected traces to make predictions about the state 
of the network. Those skHled in the art of network monitoring recognize that the network monitoring functfon Z be 
mplemen ed ,n several different ways. One implementation would be a shim layer between the p^ ap ^^cation and 
the socket layer Another .mplemen.a.ion would be a processing layer above any packet filtering soLare General 
S bU :" ° Ver ,hG PaCket fi ' terin9 SO,,ware wi " P rovide more ~.e estimates of bandwidth and dot 

8?0 shoui ,d T n,, ° n Pr6SentS "° rec <~ nt < hat ■» ^width monitor 380 and the policyTecisbn step 

™:T^n2T ^ maChine ' ,h ° U9h CO ' ,OCatin9 ^ ,W ° m0dUl6S minimiZ6S lhe ov - head - oommtT 

l 02L coSJ H Sh ° WS an 8XamP ' e bandWidth eStima,0r module 380 in accordance with the present invention The 
monitors all connections between the clients and the proxy. The same trace monitor (or another instance of it is also 

traff,c be,ween ,he proxy and a " web se,vers - For each ™ ntored — tion 



time when a new connection is established, 

source address, source port number and destination address, destination port number of each network connection 
number o« bytes sent and recerved on each connection and the respective timestamp o, each send and Tec-e 

time when connections are closed. 

I^n 1 IJ h ? S , ,a,i !' iCa ' an !' y2er 1020 maimainS 3 da,abase of a " P ast and a » cur ™t active connections For each 
on Z l , T PS 3 t0rV °' Wh6n h ° W many byl6S W6re S8nt and/or received « lh at connec ion BaseS 
For £LS a, ,' S,ICal ana ' y2er 1020 " 3ble ,0 make predic,ions aboul ,he time of future clectfons 

For predicting server to proxy download time and proxy to client download time we use different heuristics 
[0082] Server to proxy traces exhibit behavior that is common among wide-area TCP connections It has been ob- 
s rved that sequence Vs «,mes,amp traces are non-linear and we have found that linear estimatio ££££ such a S 
average or median bandwidth don't yield accurate estimates. To predict the download time of an object ota qten size 
bandwidth estimator loo^ 

of the download time for all objects that have roughly the same size. The median, or some other appropriate statist Z 
function of th.s distribution, is returned as the download time estimate appropriate statistical 

[0083] In contrast, proxy-to-clien, TCP behavior is dominated by the effects of having a bandwidth constrained link 
wh,ch ,s ,yp,ca,.y the .as, hop 1 60-1 64. Because there is a bottleneck link, the aggregate of all actfve TCTo^necins 
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to a client typically saturate the bottleneck link. Thus, for providing proxy to client download time estimates, bandwidth 
estimator aggregates all active connections to a client into a single group. For each group of connections it plots a time 
vs number of bytes plot and then performs a linear curve-fit on the data. This technique provides accurate estimate of 
the current available bandwidth between the proxy and the client. Although this value changes with time, the oscillations 
5 in most cases are bounded. For example, bandwidth monitor can easily detect whether the client is connected by a 
10Mb ethernet, 14.4 modem, or 28.8 modem by analyzing at the output of the curve fitting algorithm. The predicted 
values are not exact (20- 30% deviation from the correct values), but those skilled in the art recognize that accuracy 
of bandwidth estimation can be improved by refining the estimation algorithm. 

[0084] Consideration is now given to the transcoding user interface. Transcoding user interface is the interface 

10 through which the user can dynamically change the tradeoff between image quality and download time. It is represented 
as the item 260 in Figures 2-5. Figure 11 shows an example implementation of the user preference interface in ac- 
cordance with the current invention. This user interface is represented well as a linear slide bar, which can be imple- 
mented as a Java applet, to allow users to continuously vary the image quality. As shown in Figure 11 an option for 
selecting color (vs. gray scale) is also provided. Further, an option for selecting autopilot can be used in the following 

is two ways: 1 ) as an indication to the proxy that it can automatically decide whether or not to perform transcoding, based 
on the current estimates of the client-proxy and proxy-server link bandwidths and currently available CPU resources 
as discussed above, or 2) as an indication to the proxy to try and maintain the current download times that the user is 
currently experiencing (i.e., by varying image quality at the proxy to compensate for dynamic bandwidth changes that 
are detected by the bandwidth estimation subsystem), or both 1 ) and 2) . On occasions when the user requires a full- 

20 fidelity version of the data, they can easily request the refined data by adjusting the slide bar (downwards) and request- 
ing the higher quality data. Typically, we find that almost all of our web browsing can be done by transcoding images 
to a sixth or tenth of their original size, and only rarely do we actually find a object, such as a map, that we need to 
request a refinement of. Note that alternative HTML-only interfaces can have been invented for supporting non-Java 
devices such as windows CE based platforms. 

25 [0085] Now wc consider streaming image transcoding. The above description is generally applicable to a store-and- 
forward transcoding proxy. Following, we describe conditions under which it is beneficial for a streamed image trans- 
coder to engage in transcoding. A streamed image transcoder is an image transcoder which starts writing out image 
data encoded in an output format before having fully read in the complete input stream of bytes corresponding to the 
entire image encoded in the input format. 

30 [0086] In Figure 12 we illustrate an example of the reasoning behind our algorithm by using a timing diagram. The 
input image arrives as a stream of bits spaced apart by 1/B^. The streaming image transcoder will take a group of G 
bits for transcoding, incurring a small store-and-forward delay D v The group of bits are then transcoded into a group 
of G p output bits : incurring a delay If D^.D V then the image transcoder can convert each input group G to its 
corresponding output group G p before the next input group G needs to be processed. In this case, the streaming image 

35 transcoder's internal memory requirement is bounded. However, if D^>D 7 , then the image transcoder will not be able 
to process input bits fast enough. In this latter case, given a continuous input stream, then the image transcoder's 
internal memory requirement grows without bound, i.e. the image transcoder's finite-length internal RAM buffers will 
overflow. Therefore, we desire than the transcoding delay D 2 satisfy D^D V Clearly, D 1 = G/B sp . To find let D p (S) 
= the predicted image transcoding time for an image of S bits (D p (S) actually depends on other parameters, such as 

40 image content and dimension, but we use D p (S)\ox simplicity of notation). Then, D 2 - D p (s)/(S/G). To avoid overflowing 
the RAM buffer, the group transcoding delay must satisfy D p (S)/(S/G) < G/B sp% or 



[0087] Assuming that Condition A holds true, then the output transcoded groups G p will be uniformly spaced by a 
delay equal to D v The transmission channel can send each transcoded group of bits G p in time D 3 = G^B^ In case 
i), we illustrate D 3 (i) < D p i.e. each output group G p , can be sent before the next output group is ready for transmission. 
In case ii), D 3 (ii)> D v so that the output transmission link cannot send the produced bits fast enough to keep the output 
50 queue empty. In case ii), the transmission link's output queue grows without bound given a continuous stream of trans- 
coded bits, causing overflow for finite-length link buffers. Therefore, we desire that the delay caused by the transcoded 
output group size D 3 (i) < D v Clearly, D 3 (i) = Gp/B^. To avoid overflowing the transmission link's output buffer, the 
transcoded output image group size G p must satisfy G p /B pc < G/B $p , or 



D p (S)<S/B i 



ps 



(Condition A) 



45 



55 



c > B n JB nr 

ps pc 



(Condition B) 
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where c = group image compression ratio G/G pi which we assume to be on average equivalent to the overall image 
compression ratio. In summary, the streamed image transcoder should only perform transcoding when both Condition 
A and Condition B are satisfied. If the proxy-server link is the bottleneck, i.e. B cp < B pc , then Condition B reduces to c 
> N, where N is a number less than 1. Normally, the compression ratio is always greater than 1, so Condition B will 
always be satisfied. In this case, only Condition A must be satisfied in order for transcoding to not be disadvantageous. 
In fact, when the proxy-server link is the bottleneck, Condition B could be interpreted as providing an upper bound on 
the ratio of expansion allowed for a transcoded image, namely 1/c < B^B^. Expansion of an image may occasionally 
be necessary when format conversion is mandatory, e. g. Gl F->Palm format. The above equation allows us to determine 
when such format conversion will increase the chances of buffer overflow, and when format conversion will not cause 
buffer overflow. For example, if B $p = 1 b ps , 6^ = 2 b ps> and G = 1 bit, then Condition B says that the output group G p 
can expand to a maximum of 2 bits. If the client-Droxv link is the bottlpnerk i p R s r thon r.™Hiti^n r 
the image compression ratio c must be greater than the ratio of proxy-server to client-proxy bandwidths in order for 
transcoding to be worthwhile. In addition, Condition A must still be satisfied. 

[0088] It is noted that condition A and condition Bare tight bounds that assume that the buffer must never be allowed 
to overflow. Those skilled in the art recognize that looser constraints may be derived given that images are of finite- 
length, rather than the continuous stream assumed in the analysis. More relaxed constraints would permit more time 
for transcoding and/or allow less aggressive compression. 

[0089] It is thus an aspect of the present invention to provide a method for a transcoding proxy to facilitate browsing 
between client devices and servers connected via a communication network. The method includes receiving an HTTP 

GET request from a client devices for an nhiPPt Rtnr«H at nnD r»f tho con/ore fnn.iorrlin« O t— r , * lL _ _ _i_r__* 

to the server, receiving the object from the servers, examining preferences specified by a user of the client device, 
examining contents of the object, examining communication network characteristics, choosing a set of transcoding 
parameters, forming a transcoded form of the object, sending the transcoded form to the client, and/or examining 
network characteristics including estimating network bandwidth between the server and the proxy as well as between 
the proxy and the client, and/or estimating delay between the servers and the proxy as well as delay between the proxy 
and the client device, and/or providing a feedback to the user about a level of transcoding performed on the object, 
and/or the step of examining including determining the size of the object, and/or determining dimensions of the object, 
and/or calculating the compression ratio of the object. If the object is of type image, dimensions of the image object 
are determined by area of the image in square pixels, and the compression ratio is determined by the bpp ratio of the 
image object. The present invention allows both store-and-forward and streaming transcoding, thus allowing forming 
a transcoded form before the step of receiving the object from the servers is complete. This method can be applied to 
JPEG, GIF and other image types. Another aspect of this invention is that it allows sending out the transcoded form 
before the step of forming a transcoded form of the object is complete. 

[0090] There are several other considerations that are important. The above examples for the concepts of the present 
invention are usual for image and video, etc. The wide use of the Internet has shown the value of JPEG and MPEG 
compressed image data. Audio coded data also needs to be decompressed, mixed with special sound effects, merged 
with other audio data, edited and processed in the real domain. Similar implementations are performed for other in- 
dustrial, commercial, and military applications 

[0091] This invention may also be provided as a process, an article of manufacture, apparatus, system, architecture 
and/or a computer product. For example, it may be implemented as an article of manufacture comprising a computer 
usable medium having computer readable program code means embodied therein for causing a computer to perform 
the methods of the present invention. 

[0092] It is noted that although the description of the invention is made for particular arrangements of steps, the 
intent and concept of the present invention are suitable and applicable to other arrangements. For example, the in- 
vention is also adaptable to any browser although embodiment is directed towards web browsing only. Although primary 
consideration is given to dynamic implementations, the invention may be employed with a combination of static, quasi- 
dynamic and dynamic implementations. 



Claims 



1 . A method for a transcoding proxy to facilitate browsing between a plurality of client devices and a plurality of servers 
connected via a communication network, the method comprising: 



receiving a request from one of the client devices for an object stored at one of the servers, 
forwarding the request for the object to said one of the servers, 
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10 



receiving the object from said one of the servers, 

examining preferences specified by a user of said one of the client devices, 

examining contents of the object, 

examining communication network characteristics, 

choosing a set of transcoding parameters, 

forming a transcoded form of the object, and 

sending the transcoded form to said one of the clients. 

15 2. A method as recited in claim 1 , wherein the network characteristics includes bandwidth, and examining network 
characteristics includes estimating network bandwidth between said one of the servers and the proxy as well as 
between the proxy and said one of the clients. 

3. A method as recited in claim 1 or claim 2, wherein the network characteristics includes delay, and examining 
20 network characteristics includes estimating delay between said one of the servers and the proxy as well as delay 

between the proxy and said one of the clients. 

4. A method as recited in any preceding claim, further comprising providing feedback to the user about a level of 
transcoding performed on the object to form the transcoded form. 

25 

5. A method as recited in any preceding claim, wherein the step of forming a transcoded form employs dynamic 
adaptation. 

6. A method as recited in any preceding claim, wherein the step of forming a transcoded form is started before the 
■;30 step of receiving the object from said one of the servers is complete. 

t; 7. A method as recited in any preceding claim, wherein the step of sending out the transcoded form is started after 
processing an initial fraction of the received object, and before the step of receiving the image object from said 
e one of the servers is complete. 

35 

8. A method as recited in any preceding claim 1 , wherein the step of sending out the transcoded form is started before 
the step of forming a transcoded form of the object is complete. 

9. A transcoding proxy system for facilitating browsing between a plurality of clients and a plurality of servers con- 
40 nected via a communication network, the proxy comprising: 

an HTTP proxy engine to receive a request from one of the clients for an object stored at one of the servers, 
and to fetch the object from said one of the servers, 

45 an object transcoder to form a transcoded form of the object using a set of parameters for transcoding, 

a dynamic policy module to determine the set of parameters of transcoding, 

an image size and delay predictor module to gather characteristics of the object, 

50 

a user preference module to gather quality preferences specified by a user of said one of the clients, and 

a bandwidth estimation module to estimate available network bandwidth, 

55 wherein said dynamic policy module dynamically adjusts the parameters of transcoding using the input re- 

ceived from the image size and delay predictor module, user preferences module, and bandwidth estimation 
module for the purpose of improving satisfaction for the user, and the transcoding system provides feedback 
to the user about the level of transcoding performed. 
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A computer program comprising computer program instructions to cause a computer to perform the steps of the 
method as claimed in any of claims 1 to 8. 
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